Developing a Protein Interaction Prediction Algorithm on HPC
نویسندگان
چکیده
The prediction of protein-protein interaction is one of the fundamental problems in bioinformatics. A novel algorithm called STRIKE has shown to achieve good performance in protein-protein interaction prediction. It assumes that proteins interact if they contain similar substrings of amino acids. In this paper, we developed a parallel STRIKE algorithm and we implemented our proposal on Cluster system. Using short protein sequence sets, the overall execution time of a parallel implementation of this bioinformatics algorithm was decreased to about 5 times when increasing number of nodes from one compute node to 6 parallel nodes. Key optimizations to the implementation are also discussed. Keywords— protein-protein sequence matching; parallel computing; performance analysis; HPC computing; sequence
منابع مشابه
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...
متن کاملDiscovering Domains Mediating Protein Interactions
Background: Protein-protein interactions do not provide any direct information regarding the domains within the proteins that mediate the interactions. The majority of proteins are multi domain proteins and the interaction between them is often defined by the pairs of their domains. Most of the former studies focus only on interacting domain pairs. However they do not consider the in...
متن کاملPrediction of Coffee Effects in Rats with Healthy and NAFLD Conditions Based on Protein-Protein Interaction Network Analysis
Background and objectives: Non-alcoholic fatty liver disease (NAFLD) is a common liver condition. On the other hand, coffee consumption has shown promising for gastrointestinal diseases. Detection of the most valuable biomarkers of decaffeinated coffee treatment in healthy and non-alcoholic fatty liver disease conditions was the aim of the present study. Methods:</stro...
متن کاملInverse protein folding in 3D hexagonal prism lattice under HP model
The inverse protein folding problem is that of designing an amino acid sequence which has a prescribed native protein fold. This problem arises in drug design where a particular structure is necessary to ensure proper protein-protein interactions. Previously, tubular structures for a three-dimensional (3D) hexagonal prism lattice were introduced and their stability was formally proved for simpl...
متن کاملDeveloping a Dynamic Regression Model for Predicting Future Operating Cash Flow
The purpose of this research is to develop a dynamic regression model for prediction of future operating cash flows of firms accepted in Tehran Stock Exchange. So, the information of 250 companies were considered during 2004 to 2017. In this study, operational and economic variables were added to the fundamental model of Bart, Cram and Nelson (BCN). Due to the simultaneous effect of sales growt...
متن کامل